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Abstract 

A simple and natural model is introduced for studying a trend in the tail of a probability distribution over 
time (or over space). Estimation and testing procedures are provided with application to rainfall data. 
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1 Introduction 



In the climate change dispute some people suggest ( Klein T ank and Konnen (2003); Groisman et al. (2005); 
Alexander et al. (2006]); Zolina et al. (20091) that perhaps there is no or little change in the mean of the 
probability distribution of daily rainfall over time but there is a change in the tail that is, more extreme events 
occur more frequently. The present paper - like |Smith| ( |1989| ); |Hall and TajvTdi| ( [2000] ); |Hanel et al.| ( [2009] ) - 
considers a trend in extremes from the point of view of extreme value theory. A simple and intuitive way to 
study such a trend is by looking at the tail of the probability distribution at time s > compared to the tail at 
time zero. 

We consider random variables X(s) where s > is time. Write F s (x) := P{X(s) < x} for iel. Assume 
that F is in the domain of attraction of an extreme value distribution (F e V{G 1 ) for some 7 e E) and that 

l-F fl (aQ 
1 - F {x) 

tends to a positive constant for all s > when x tends to the right endpoint x* of F . Hence the exceedance 
probability at time s is systematically a factor times the exceedance probability at time 0. We take for simplicity 
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as our model for relative risk 



lim ) |jffi = e» (1) 



with c a real constant representing a possible trend. This means that e.g. for s = 1 and c = 1 the probability 
of an extreme event taking place at time 1 is e times the probability at time 0. 



First note that by Resnick (1987), section 1.5, relation © implies that F s e 2?(G 7 ) for all s > 0. Recall 
that F s e V{G~ ( ) if and only if for some positive function a s 

km — = , (2) 

t->oo a s (t) 7 

x > 0, where U s is the inverse function of 1/(1 — F s ). 

A slight extension of Resnick's result provides a relation equivalent to for the functions U s : for s > 

lim^-^=g^. (3) 

Note that relation ((3) implies 

]im^ = e»". 
Furthermore, for 7 > 0, relation ([3]) is equivalent to 

lim 7T77Y = eC7S - W 

Relations ([l), ([3]) and @ can be used to build estimators for c. We proceed in a semi-parametric way. 

Suppose that we have repeated observations at discrete time points = s < si < s 2 < . . . < s m . 
It is assumed that {JQ(s,)| are all independent and that Xi(sj), X 2 (sj), . . . , X„(sj) have the same 

distribution function F Sj for all j. Let Xi >n (sj) < X 2 , n (sj) < ■ ■■ < X ntn (sj) be their order statistics. 

(i) If it is known that 7 is positive, relation Q leads to a least square estimator for c: 



J2 Sj (log X n ^ kt n(sj) - logX„_ fe! „(0)) 
c W := >- S • (5) 

We shall discuss the estimator for 7+ := max(0, 7) later on. 
(ii) Relation ([3]) also leads to an estimator for c. Intuitively relation ((3]) means that 

1 

l+7n,fc ' 



"0 



(t) 
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where is an estimator for 7. Define by 



log^l + 7 n , fe f-r^r J -csjj 



«o(f) 



i.e., 



A(2) ._ fcl 



E »j logfi + ^A^^M\ v» 



a (n/fe) 



(6) 



E- 



For 7„,fc = 0, the estimator is defined by continuity, 
(iii) Finally relation leads to an estimator for c. Intuitively relation means that 

, l-F s (X n _ fe ,„(0)) 

log ^- « cs 

l-F (X„_ fe , n (0)) 

where F s is the empirical distribution function at time s. This leads to 



a(3) ._ J 



E iog(i E 

;"=1 v t=l 



{X I (s J )>X„_ fc ,„(0)} 



(7) 



E «i 



The problem of defining and estimating a trend in extreme value theory has been considered by a number 
of authors including |Smith| ( |1989] ) ; |Hall and Tajvidi| ( |2000| ) ; |Coles| ( |2001| ) ; [Yee and Stephenson| ( |2007| ) and 
more recently addressed by Mannshardt-Shamseldin et al. ( 2010| ). A review of results on trend estimation is 
given in the Appendix. 

We have restricted ourselves to the model in ([l]) that is, a trend function of the form e cs , since we are 
interested in a monotone trend and also because a more general change would have been more difficult to 
detect. Figure [T] gives some insight into the difficulty of detecting more complex trend functions, namely a 
temporal trend in the extreme value index 7. We could have studied more general (not monotone) changes 
in a similar manner, possibly with adjustments enabling other appropriate estimation procedures. 

The outline of this paper is as follows. In section [2] we state the conditions for consistency and asymptotic 
normality of the estimators introduced in (|5]), ([6]) and ([7]). Proofs are postponed to section [5] In section 
[3] we collect some simulation results for illustrating and assessing finite sample performance of the various 
estimators for the trend. In section[4]we apply the methods to daily rainfall at 18 stations across Germany and 
The Netherlands and give a tentative interpretation of the results. Indeed for some stations the probability of 
extreme rainfall has increased by about 2% in each decade. 
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Figure 1: Estimates j(sj) for each one of the m+1 = 18 time points Sj — j/m, j — 0, 1, 2, . . . , m, at one particular location, a gauging 
station in The Netherlands. The number k corresponds to the number of observations above the hight random threshold X n -k, n (sj) 
(Left). Corresponding estimates y(sj) with fc = 30 (Right). 
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2 Results 

Let us consider cS 1 ' first and suppose 7 > 0. Let 7^ fc (sj) be an estimator for 7+ based on the observations at 
time Sj . Since 7+ does not depend on s, we use a combined estimator 

^V=-E%t^-). (8) 

We consider estimators %^ fe (sj) that are consistent, i.e., if ((4]) holds and the number k = k n of upper order 
statistics used satisfies fc = fc n 00, k n /n — > 0, n 00, then jt k (sj) —> 7 + . 

' n— >oc 

Next we discuss the asymptotic normality of 7+^. Consider a second order condition for U : suppose there 
exists a positive or negative function f3 with lim^oo /3(t) = such that for x > 

hm ^— = x 7 — - — (9) 

i^oo /3(f) p 

with p a non-positive parameter. Further we need a second order strengthening of condition Q : suppose 
that for all j 

v. At) + 

Um Uo{t) - = e ? = ■ (10) 

t^oo /3(f) p 
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We consider estimators J^ k {sj) such that 

v^kte) - io g x„_ fein ( Si ) - iog^(^)) n -A (r+( s ,), (ID 

say, for all j, where (r+(sj), B + (sj)) has a multivariate normal distribution provided k (the number of upper 
order statistics used in 7^ fe (sj) for all j) satisfies k = k n —> oo and 

lim \l k„ Bi - — ) =: A 



C) 



exists finite. Various estimators j„ k (sj) are known with this property, notably Hill's estimator (Hill ( 1975 )). 
Next we consider c^ 2 \ Consider for each j estimators % t k(sj), a Sj (n/fc) with the following properties. 

a. Under the conditions F e £>(G 7 ) and ([p 



a S] (f ) 



provided k = k n — > oo, k n /n — > 0, n — > oo. 

b. Consider the following second order strengthening of the two conditions in a: there exists a positive or 
negative function a with lim t _ i . 00 a{t) = such that for each j and x > 

U (tx)-U (t) _ xZ-1 , 

lim , , 3L_ = 1 ( Xl+P - 1 _ ^) =: tf 7 „ (af) (13) 

t^oo a (t) P\ 1 + P 7 / 

where p is a non-positive parameter. Further: 



U Bj (t)-Ua(t) _ gc^j-l 
<Jo(t)_ 



lim " oW 1— = ff 7 , P (e cs 0. (14) 



Under conditions ( [13] ) and ([14]) 

^fo> "7.^-1, ^ 

-A (r( Sj ), s( Sj )), 

n— foo 

say where (r(sj), A(sj), B(sj)) are independent random vectors and have a multivariate normal distri- 
bution for each j provided k = k n — >• oo, and 

lim y^k^al - — ] =: A 



5 



Results 



exists finite. Various estimators are known with these properties. 
Theorem 1 1. Assume 7 > 0. Under condition ^ 



c v ; — > c. 



Under conditions ((9]) and ( |T0 



d j=i v 



-A(0) _j_ e e " e i-l ^ 



1 P 



2. Under condition ([!]) 

c (r) Ac /or r = 2, 3. 



[/nder conditions JT3] > and ( fT4l ) 

3=1 ' i=l 

+ e" c ^ (e^'-Bfo) - 5(0) - 6 A(0) +A^ p (e cs o) } / 1>. 

7 3=1 

and 



^( C ( 3 ) -c) n -A " U " ;V - ; - (1) + e~ cs >b 3 ( Sj ) \X / ]T S 2 , 

where {lf' s ''(t)} (>0 are independent standard Brownian motions (j = 1,2, . . . , m) and 

0, p = 0. 

Corollary 2 Assume c = 0. Under the conditions of the Theorem, 
1. if k — k n is such that vfc j3(n/k) — >• 0, as n — >• oo then 

Q a) n := £ * | iogX», fc , K ( Sj -)-iogX„_ fc ,„(o) j 2 ^ X 2 (m); (15) 



3=1 



2. if k = k n is such that \fka(njk) — > 0, as n — > oo then 



£>»:=£f lE^)>^,. (0)}-l (16) 
j=i j=i ' 
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Here x 2 (m) is a standard chi-squared distributed random variable with m degrees of freedom. 

Corollary [2] gives rise to a testing procedure for detecting the presence of a trend in the tail of the underlying 
distribution functions F s all lying in the same domain of attraction. That is, Qm] n , r = 1,2, defined above 
can be used as test statistics to evaluate the null hypothesis H : c = against the alternative H\ : c 7^ 0. 
Whence H Q should be rejected at a significance level a e (0, 1) for any observed value of Qm]n verifying 
Qobs > 9i-ct( m )> tne latter being the (1 — a)-quantile pertaining to the chi-squared distribution with to 
degrees of freedom. 

3 Simulations 

Simulations have been carried out for three distributions, the generalized Pareto distribution, the ordinary 
Pareto distribution and the Cauchy distribution. The number of locations is 200 (i.e. m = 200) with Sj = j/m, 
j = 1,2,..., to. At each location there are 500 i.i.d. observations (n = 500). Then there are 1000 replications. 

The generalized Pareto distribution (GPD) with distribution function 1 — (1 + 72;) -1 / 7 for those x for 
which 1 + 72; > has been considered with 7 = —0.1, 0.1 and 0.5 (since 0.1 is a typical value for rainfall, the 
application topic). Relation fl!3| ) holds with limit zero since the left hand-side is zero (exact fit). The values 
c = —0.1 and c = 0.1 have been considered. 

The starting point in each case is a r.v. X from the GPD distribution. For location Sj we then take 
X{ Sj )=e cs ^X + (e cs ^ - 1) / 7 . That way the relations Q and (H) hold. 

Figure [2] displays the average (over 1000 replications) values of the estimators (only for positive 7), 
tP^ and as functions of the number k of upper order statistics used in the estimation. As usual in graphs 
of this type there is a stretch of the graph that is more or less straight; the idea is that in that part both the 
variance and bias are not too high. The estimators and seem to give the best performance. 

In Figure [2] the extreme value index 7 and scale a have been estimated by the moment estimator. Figure 
[3] gives a comparison with the maximum likelihood estimator. 

For the ordinary Pareto distribution with distribution function 1 — x" 1 ! 1 , x > 1, 7 > 0, we simulate the 
trend by taking X(sj)= e CSjl X where X follows the Pareto distribution. Again we have an exact fit. 

For the Cauchy distribution again the trend is simulated by taking X(sj)=e CSj X (since 7 is 1). Relations 
fl!3| ) and fl!4D hold. In this case |a| is a regularly function with index —2 (rather fast convergence). Figure[4] 
displays the simulation results for the Pareto and Cauchy distributions. 

A much more comprehensive simulation study has been performed. From the described simulations and 
the other ones we conclude that the estimators perform reasonably well. Estimators and seem to 
behave better than In the next (application) section we shall emphasize the smoother c^ 2 \ 
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Figure 2: Estimated means of £S T \ r = 1,2, 3, plotted against the same number k of top observations on each location Sj = j/m, 
j = 1, 2, . . . , m, with underlying Generalized Pareto distribution, in either case of true value c = —0.1 or c = 0.1. 
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Figure 3: Estimated means of c' 2 ' either with Moment estimator and ML estimator for the Generalized Pareto distribution with 
7 = 0.1 and c = 0.1. The number of samples amongst the 1000 replicates that have produced valid ML-estimates is presented on the 
right hand-side. 
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Figure 4: Estimated means of c^ r \ r = 1,2, 3, plotted against the same number k of top observations on each location Sj — j/m, 
j = 1, 2, . . . , m, in case of true value c = 0.1, with Pareto and Cauchy parent distributions. 
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Figure 5: Selected gauging stations in The Netherlands (left); selected gauging stations in Germany and Dutch stations near the 
borderline (right). 




4 Data Analysis 

As an application of the tail trend assessment methodology developed in this paper, we will look at daily 
rainfall totals collected in 18 gauging stations across Germany and The Netherlands, comprising latitude 47N- 
53N and longitude 5E-13E. The geographic location of the stations is displayed in Figure [5] Rainfall data 
are from the European Climate Assessment and Dataset. We note that different stations suffer from different 
coverage in time in the sense that not all stations have started regular recording of data at the same year. 
Moreover there are some stations with missing values. All of them however meet the basic criterium for 
completeness that there is less than 10 days missing per year which leaves us with 90 years of complete data 
from 1918 up to 2007. 

Figure [6] displays yearly maxima plots for several stations on the basis of available data. We get a mixed 
picture. In STN41-Halle, for instance, precipitation does not seem to be as severe now as in the first half of 
the 20th century anymore, whereas STN39-Dresden shows increasingly annual maxima with the largest peak 
of 158 mm of rain, spot on the catastrophic event of 12 August 2002. 

As long as there is at least one day in between, there is not much dependence in the amount of rainfall on 
two different days. For each station we select first the highest observation. Then we remove the observations 
on the day before and after. Next we select the highest observation from the remaining data, etc. This goes 
on until we have selected 70 days or the threshold of 1 mm is reached. That way we get a sequence of higher 
order statistics from i.i.d. data. Table [T] displays the number of rain days (i.e. with at least 1 mm of rain) per 
station. 
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Table 1: Total number of rain days in the period from 1918 to 2007 for each selected station. 



STN 


Name 


Country 


Lat. 


Lon. 


Rain days 


36 


Bamberg 


D 


48°49'N 


11°33'E 


6276 


37 


Berlin 


D 


52°31'N 


13°20'E 


6270 


39 


Dresden 


D 


51°31'N 


13°44'E 


6293 


40 


Frankfurt 


D 


50°6'N 


08°40'E 


6069 


41 


Halle 


D 


51°28'N 


11°57'E 


6230 


42 


Hamburg 


D 


53°33'N 


09°59'E 


6125 


44 


Hohenpeissenberg 


D 


47°48'N 


10°59'E 


6230 


48 


Mtinchen 


D 


48°08'N 


11°34'E 


6153 


49 


Miinster 


D 


52°59'N 


07°41'E 


5904 


52 


Stuttgart 


D 


48°46'N 


09°46'E 


6289 


119 


Ter Apel 


NL 


52°53'N 


07°04'E 


6069 


120 


Heerde 


NL 


52°24'N 


06°03'E 


6300 


121 


Winterswijk 


NL 


51°59'N 


06°42'E 


6298 


122 


Kerkwerve 


NL 


51°40'N 


03°51'E 


6276 


123 


Westdorpe 


NL 


51°13'N 


03°52'E 


6300 


125 


Roermond 


NL 


51°11'N 


05°58'E 


6300 


128 


De Bilt 


NL 


51°06'N 


05°11'E 


6299 


129 


Eelde 


NL 


53°08'N 


06°35'E 


6300 



Figure 6: Some yearly maxima of daily rainfall. 

STN1 23 - Westdorpe STN 128 - De Bilt 
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Since we are not looking for a spatial trend now we shall make a study of the highest daily rainfall amounts 
in the 90 year period for each station separately. At each location, for 7+ (in connection with c^) we use 



Hill's estimator and for 7 and So (for c^) we use the moment estimator (cf. sections 3.5 and 4.2 of deHaan 
land FerreiralQ2006D ) 

The point estimation of the extreme value index 7 and trend estimation is conducted with the same 
number of upper order statistics k, just as prescribed in each definition of c^ r \ r = 1,2,3, introduced in ([5]), 
([6]), and ([7]), respectively. 

In order to have enough tail related rain measurements per time point we found reasonable to take 
consecutive intervals of 5 years over the 90-year span. The disjoint intervals serve as our time points indexed 

by j = 0, 1,2, . . . , 17 = m. 

For the purpose of data analysis, the gauging stations have been divided into two groups, determined by 
their alignments in the general climate characteristics (according to the Koppen-Geiger climate classification 



system, see e.g. Kottek et al. (12006 1). Each selected station in Germany was classified as either humid oceanic 



or humid continental. All stations across The Netherlands are classified as humid oceanic. Although we are not 
looking for spatial coherence we hope to benefit from this information to get a more systematic presentation 
of our results. 

Estimates of 7 in a vicinity of 0.1 often emerge in connection with the extremal behavior of distributions 



underlying rainfall records (see e.g. Buishand et al. (2008), p. 239; also Mannshardt-Shamseldin et al. (20101 



p. 492). This seems to hold for most of the considered stations although there is a lot of variation. 

Figure [7] includes sample paths of the three proposals for estimating the tail trend parameter c <E K for 
some typical gauging stations. As already discussed, we shall handle estimation of c by screening plots as 
in Figure [7] for plateaus of stability in the early part of the graphs pertaining to the smoother estimator c^ 2 \ 
coherent with the path patterns of and whenever possible. 

Table[2]contains the estimated values of c for each station by their increasing order of magnitude. Standard 
errors are also provided. Bearing in mind the simulation results from section[3j here we shall confine attention 
to the leading estimator c&K Since the combined moment estimator 7 = 7^ is a consistent estimator for 7, 
the asymptotic standard error of = c can be estimated by 



-78sj -JCSj) 2 0-£(7) 



k l-^-f J L 7 4 m 



where Sj := Sj/ s 2 , 

3=1 



4(7) 



V 



I+7 2 , 7 >0, 

(l~7) 2 (l-27)(l-7+67 2 ) „ ^ n 
(l-3 7 )(l-4 7 ) ' ' < U 
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Figure 7: Sample path of the overall moment estimator for 7 and sample trajectories of &- T \ r = 1, 2, 3, as a function of the same 
number k of top observations for each 5-year interval between 1918 and 2007 for several stations. 





STN39 - Dresden 




-0.5 -i 1 1 1 1 1 1 1 1 1 1 1 1 
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Table 2: Station-wise estimates for the trend parameter c by adopting estimator c 1 - 2 '. 



Oceanic Stations 











Station 


c 


s.e.(c) 


7 










STN 49 


0.16 


0.122 


0.12 


Continental Stations 






STN 40 


0.2 


0.131 


0.1 


Station 


c 


s.e.(c) 


7 




STN 123 


0.2 


0.113 


0.19 


STN 41 


-0.2 


0.095 


0.2 




STN 52 


0.25 


0.120 


0.08 


STN 44 


-0.1 


0.101 


0.15 




STN 122 


0.55 


0.131 


0.1 


STN 48 


-0.1 


0.109 


0.13 




STN 119 


0.7 


0.122 


0.15 


STN 37 


-0.06 


0.122 


0.2 




STN 120 


0.7 


0.132 


0.15 


STN 39 


0.25 


0.116 


0.19 




STN 125 


0.8 


0.207 


0.08 


STN 36 


0.32 


0.116 


0.18 




STN 128 


0.8 


0.261 


-0.05 










STN 121 


0.9 


0.290 


0.05 










STN 42 


0.9 


0.325 


0.05 










STN 129 


1.0 


0.221 


0.07 



<(7) := 



and 

' 2 + 7 2 , 7 >0, 

2-167+5l7 2 -697 3 +507 4 -247 5 . p 

( l_2 7 )(l-37)(l-47) ' 7 ^ U - 

It remains to assess whether the stations with near zero estimates in fact have a null trend. Examples are 
STN37-Berlin, STN48-Miinchen, STN49-Miinster, STN52-Stuttgart and STN39-Dresden. The last site we refer 
to is STN40-Frankfurt, where testing for the presence of a trend is also of practical importance given the poor 
circumstances involving the estimation of the parameter c: the erratic sample paths displayed by the three 
estimators often cross the c = line (cf. Figure [7]). In the case of STN40-Frankfurt it seems difficult to find 
a "plateau of stability" in Figure [7J the estimate c = 0.2 is rather uncertain. Therefore, we aim at a more 
definite decision on the value of c by means of a testing procedure upon STN40-Frankfurt in particular. 

In order to tackle the problem of testing the presence of a trend in time, i.e., the problem of testing 
hypothesis 

H : c = versus Hx : c ^ 0, (17) 

we shall use Qm,n from corollary [2] as our test statistics. Hence, for r = 1, 2, the null hypothesis H : c = 
is rejected in favor of the two-sided alternative Hi : c ^ if Qm]n > q m ,i-a, where q m ,i-a denotes the 
(1 — a)-quantile of the chi-square distribution with m degrees of freedom. 

Figure ^ depicts the sample trajectories pertaining to the two-sided test statistics Qm]n and Qm] n in com- 
panion with critical values at the nominal size a = 0.05 and with respect to the referenced stations STN37- 
Berlin, STN48-Munchen, STN49-Munster and STN40-Frankfurt. It seems that the hypothesis of no trend has 
to be rejected in case of STN40-Frankfurt. The two tests also ascertain a non-null trend for STN48-Miinchen 
and STN49-Miinster. There is no evidence of a particular trend at STN37-Berlin. 

Broadly speaking, the stations in the Oceanic group present higher positive values of c (but not always 
smaller values of 7) than the Continental stations. Overall, fairly positive values of c may be interpreted as 
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Figure 8: Sample paths of Qx8 n an ^ Qw.n test statistics and corresponding critical values for the two-sided test at a significance level 
a = 0.05 (xq 95(17) — 8.67) for several stations across Germany. 



Q1 test statistic 



Q2 test statistic 





being influenced by the ocean. 



Allen and Ingram ( 2002 ) describe how the intensity of extreme rainfall events depends on the availability 



of moisture. Because moisture availability is constraint on temperature (through the Clausius-Clapeyron rela- 
tionship), an increase in rainfall extremes is expected in a warming climate. Lenderink et al. (|2009) show that 
higher increases can be expected at locations that are under the influence of the sea. Increasing sea surface 
temperatures contribute to higher rainfall amounts. The results obtained in this study for the behavior of 
extreme rainfall at locations in Germany and The Netherlands are consistent with these findings. Overall, the 
Oceanic group of stations shows a stronger increase in extreme rainfall than the Continental group of stations. 



Next we discuss the interpretation of c. If c = 1.0 then in view of the fact that s is measured in periods of 5 
years, the probability of really heavy rainfall increases approximately by 12.5% during each decade. Similarly 
if c = 0.1 then the probability of heavy rainfall increases in the same period approximately by 2%. These 
results are in good agreement with the positive trend of 3% per decade found by |Zolina et al. (2009 1 for the 
second half of the century (1950-2000) using a different metric. 
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5 Proofs 



We shall use the following representation: 



{^n-i,n(Sj)}" =1 ={C Sj (y n -i, n (Sj))}™ =1 

where {Y n - i}n (sj)}f =1 are the n— th order statistics from the distribution function 1 — a: -1 , x > 1, indepen- 
dently for j = 1, 2, . . . , m. 



de Haan and Ferreira 



Proof of consistency 

For the consistency of note that (fc/n) Y n -k, n (sj) — — > 1, n — > oo (cf. 

2.2.2) and that lim^oo U s (tx)/U s {t) = x 1+ locally uniformly for x > 0. Hence for j = 1,2,. 

Ti p 

\0g X n -k, n {s j) -\0gU s ( T ) > 1. 



(2006), Corollary 



The rest is easy. 

Similarly with respect to we get that 



X n — k,n( s j) ~ ^sj(f) 

Mi) 



(18) 



This limit relation combined with relation ((3]) leads directly to the consistency of . 

With respect to the consistency of we begin by noting that the domain of attraction condition 



n 

lim -P{ 

n—t-00 f£ 



for all I = 1, 2, . . combined with ([p implies 



lim e- cs > -P 

n— too h 



{Xiisj) > [/oQ+ Mo QJ = (1 + 1 x)-^\ 
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j = 1, 2, . . . , m. Hence the characteristic functions converge: 

t n 

E 6XP { i k ^ I{Xl (Sj ) > u ° {n,k) +x a " {n/k) } I 
1=1 

ft \ n 

( ScX p{*fc / {X 1 ( S3 )>£/ (n/fc)+a :(I o(n/fe)}}J 

= (e«*p{x 1 (* J )>Db(5)+ a! ao(2)} 

+ l-p{x 1 ( Sj )>C/ (|)+a ; ao(|)} 
/ eCS , fc(eit/fc _ 1} e-^(n/k)[l-F S] (U (n/k)+ X a (n/k))] 

n— too I J 

for every t € K. Owing to Levy's continuity theorem, the latter implies 

1 " p 

I J {^(^)> U Q (n/k)+x a Q (n/k)} f?* 3 (1 + 7 x)- lh . 

Next use (fT8i □ 



For the proof of the asymptotic normality we need an auxiliary result. 
Lemma 3 Assume conditions ( fT3"l ) and ([14]). Define 



a s (t) ■= e csp a{t), 

/ e csp — 1\ 

a s (t) := e cs %,(i)(l + a(t) J. (19) 



Then /or s € K and x > 

C7 a (ts)-C7 s (t) _ x T -l 
a s (t) 7 



lim ^' „ = ff 7 , P (x). (20) 



Proof: For simplicity we write d for e cs in this proof. 
First note that ( [T3] ) implies that 



U 3 (tx)-U s (t) _ , 7 a-v-l Up(tx)-Uo(t) _ a T-l a Q (tx) _ 7 

ao(t) " 7 = ao(t) 7_ , goffl _ « 7 ~ 1 

a(t) a(t) a(t) 7 

/ a(tx) a (tx) 



a(t) a Q (t) 



l)H JjP (d) (l + o(l)) 
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Proofs 



converges to 

r p — 1 |7 7 — 1 

H„ Jx) + x> + {x->+p - 1)F 7 p (d). 

P 7 



Next write 

a.(t) 7 _ "(0 / Oo (t) a 7 flo(t) X 7 - 1 „,(t) 1 



a s (t) a s (t) \ a{t) a s (t) 7 a(t) 

This converges to 

( E 1 p (x) + £< X -^± d —± + (x^ + " - 1)H 7 „(d)) - 

\ PI 'IP' 

which is equal to H l p {x). 



Remark 4 The analogue of Lemma ^stemming from conditions ^ and ( [10] ) - i.e. 7 > - holds with the 
auxiliary function (3 s (t) := e csp (3(t). This leads to the following relation for every s € K, 

hm — =ic' — - — , x > 0. 

Proof of asymptotic normality 

We write 



ii m — 1 

' n , fe ' j — 1 

{m 
Sj \yk (logX n - k}n (sj) - \ogU s 



/ fc(logX„_ fc ,„(0)-logi7o(^, 
-^V 7 ^/?^ l °SU Sj (n/k) -logU (n/k) 



The result follows from ([TT]) and ((To). 

For c^ 2 ' it is sufficient to consider 



vUlogll + 7„, fe 1 f-T^r ! -cs4 

1 v a o(k) ' } 
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Proofs 



for j = 1, 2, . . . , m where 7„ ifc = l/m^™ j %,k(sj)- We use Cramer's delta method. 

log(l + 7^ = - log(l + 7 z)) 

(which is x 2 /2 for 7 = 0) and 

^ , a 1 

log(l + jx) i = 



dx 1 + 7X 

(= 1 for x = 0). Further we write 

X n -k,n{Sj) — X ra _fc ; „(0) 

fio(l) 

a (f) \ao(f) a S3 (f) 



implying 



Hence, 



X n - fc , n (0)-^ (^) ^ 3 (f)-L/o(f) 
«o(f) «o(f) 



^-t, n («j)-V*,n(0) e c ^-l 



«o(f) 7 



7 



- Vfc Uo(f) l ) + «o(t)lao(f) ^ ~" a,(f) 
%/j : *n- fc ,n(0)-EME) | ^/ ^(f)-^o(l) e^-l \) 
a (f) V ao(f) 7 //' 



^/ X n _ fc , n ( 3j -)-X n - fc , n (0) _ e c ^ -l \ 
V «o(f) 7 J 

A(0) + e c ^B{ Sj ) - B(0) + A H, ( , p (e cs >). 



«o(f) 

d e c ^ - 1 
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Proofs 



Next we apply the delta method: 



HI) 



log 1 + 7 



e c ^ - 1 \ i 



n 



d ^ 

n— f oo 



The result follows. 



log(l + jx) 1 



<9.x 



log(l + 72;)' 



e c7^ _ ! 



A(0) 



+ e c ^B( Sj -) - 5(0) + Ai/ 7:P (e cs 0} 



B(0) - ^ 1 A(0) + XH 1 . p (e c ^)}. 

7 ' > 



□ 



Finally for consider 
i 



\Xi(s)>X n - h ,n(0) 

1=1 



1 ™ 



(21) 



We write 



x n {s) 



X n - fc , n (0)-t/*(a) 



with U* and a* from Corollary 2.3.7 of de Haan and Ferreira|p006 l, coupled with lemma[3j 



U s (t), otherwise, 



e c ^a (t)(l-ia(i)), 
a*(t) := \ e -T ao ( t )[i_(e^l +cs ) 

a (i)(l + csa(t)), 



P <0, 
«(<)], P = 0^ 7 , 
7 = p = 0, 



and write (I21D as 



{l-4 s) (^(|)+-n( S )a*(f))}. 



where i* 1 ,^' is the empirical distribution function of the random sample Xi(s), X 2 {s), . . . ,X n (s). We consider 
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Proofs 



the two parts separately. By Q141 ) and Theorem 2.4.2 of de Haan and Ferreira (2006), 



«o(f) 

W(t) 



e CS7 - 1 



«o(f) 



l)-Aff 7 , p (e cs ) + A&*( S )} 



(22) 



where 



p V 7+P 
1 e cs ^-l 



P 7 

_ 

7 



- ), 7 + P^0,p<0, 
p = -7 < 0, 



(cs) 2 , 7 = p = 0. 



Further by Theorem 5.1.2 of de Haan and Ferreira (2006), since x — x n is asymptotically constant. 



H-\i-Fi 



(u*(^)+x n (s)a^))j-(l + jx n (s)) - 



-Vka*(-)(l + ^x n (s)) -> * 7 , p ((l + 7 a;„( S )) : 
for a sequence of standard Brownian motions {Wn {t)}t>a and with 



«:(*) = < ^ aW; p<0 ' 

e csp a(t), p = 0, 



and 



x-> +p 
7+P 

logs 

1 

7 



7 + P^O, P <0, 
7 + p = 0, p<0, 
x 7 log a;, p = 7^ 7, 



i(logx) 2 , p = 



7- 



Since by ( [22] ) 



e -c S7 _ 1 



7 



>/fc{(l + 7*n(*)) 7 " (i+7 

-e cs(7+1) (e- CS7 {iy (0) (l) - AH 7 , p (e cs ) + A6*(s)}) : 
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Sketch of alternative approaches 



Vk 



{i (»-*'«) +M.K(=)))- 



A W {s) (e cs ) - e cs iy (0) (l) + Xe cs (h~ Je cs ) + e CS7 * 7 „(e~ cs ) - b*(aj) . 

Proof: [of Corollary|2j By virtue of Rao's theorem ( |Rao| ( [1973j ), Section 3.b.4; see also |Serfling] ( |2002p , p.128) 
pertaining to quadratic forms of asymptotically normal random vectors, statements Q15D and Q16] ) follow im- 
mediately from the theorem. □ 



A Sketch of alternative approaches 

The subject of extreme value theory (EVT) is the study of the right (or left) tail of a probability distribution 
near the endpoint. Hence by nature EVT is an asymptotic theory The basic assumption is 

r max(Xi, X 2 , ■ ■ ■ ,X n ) — b n , ( , \-i/ti rocn 

hm P\ < x) — exp-^ — (1 + 71 ' ' \, (23) 

n->oo a n 

where X\, X 2 , ■ ■ ■ are i.i.d. random variables. It follows that the limit distribution has only one parameter, 
the shift b n e K and scale a n > are not parameters of the limit distribution. They depend essentially on the 
distribution of X\ . 

When it comes to statistics there are three basic methods: 

1. Yearly maxima (or block maxima). Over a number of years one takes the yearly maximum. Since the 
yearly maximum is taken over many underlying random variables (albeit not i.i.d.) the assumption is that the 
yearly maximum Mj can be considered the maximum over a large number n of i.i.d random variables so that 

where n is unknown. The random variables Mj are i.i.d.. The right hand-side can then be interpreted as 
a parametric model (GEV: Generalized Extreme Value distribution) so that e.g. the method of maximum 
likelihood can be applied. 

The interpretation of b n is: the level that has a return period (the mean time between consecutive ex- 
ceedances of the level) of e/(e — 1) rj 1.58 years. Hence there is no direct intuitive meaning for b„. Also 
the behavior of b n as n — > 00 can not be found. This method carries a bias stemming from replacing an 
approximate equality with a firm equality. In contrast to the next case it seems difficult to control that bias. 

2. Peaks over threshold . The basic assumption Q23| ) implies that with b(t) = b^, a(t) = aw and [t] the 
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Sketch of alternative approaches 



integer part of t for x > 



lim P' 

t— >oo 



{ J ^£>x\x 1 >b( t )} = (i +7 xr^ 



i(t) 



(24) 



Select out of n i.i.d. observations the ones that are larger than b(t). These are approximately i.i.d. and (when 



normalized) follow approximately the GPD distribution 1 — (1 + 72;) (Pickands (1975)) 



One can take for b(t) one of the order statistics J n _fc , n . In order to get meaningful results we need to have 
k = k n and k n — » 00, k(n)/n — > as n — > 00. Then X n _ k , n is close to b(n/k), i.e., the quantile F*~(l — k/n). 
Again, since for x > b(t) 



p{x 1 >x\x 1 >b{t)}*(i+ x —^y 1 



-l/ 7 



(t) 

one can consider the right hand-side as a parametric model so that e.g. the method of maximum likelihood 
can be applied. 

Next one can prove that the obtained estimators are consistent and asymptotically normal as n, the number 
of observations, tends to infinity. That is, the vector 



/< l^f|-l,7n, fc -7 



has asymptotically a normal distribution( |Smith| ( |1987| ; |Drees et a l. (2004); |Zhou| ( |2009j )). There are also 
methods to minimize the bias by choosing the number k appropriately. 

3. Point process convergence. Suppose that the basic assumption holds. Take the point process on R 2 with 
points 

r / i Xi -LnI" 

(25) 



V n ' a„ I 



This point process converges in distribution to a Poisson point process on (0, l)xR with intensity measure 
dt ■ (1 + "/x)~ 1 / 1 ~ 1 dx (cf. Pickands (1971 1). Note that the intensity measure is unbounded. Those points in 



for which {Xi — b n )/a n exceeds some threshold u are approximately points from a Poisson point process 
with (finite) parametric intensity measure so that the method of maximum likelihood can be applied supply- 
ing estimators for 7, b n and a n (see Smith (19891, cf. Coles (2001), Chapter 7). No asymptotic behavior 
(tl — y 00, u = u n decreasing) seems to be known for these estimators. 



The three methods have been explained in detail in the book of Coles (2001 ). A trend in the EVT analysis 
has been considered in all three methods. 



1'. Chapter 6 of Coles (2001 1 book treats trends in the block maxima / GEV setup. One considers time 
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Sketch of alternative approaches 



points j = 0, 1, 2, . . . and assumes (for example) 



or/and 



bn(j) =b n (0)+jc 



log a n (j) = log a n (0) +jc. 



As we saw before, b n is the level that has a return period of just e/(e — 1) years. The scale a n can be 
interpreted with some liberty as a derivative, i.e., speed of change of location. The interpretation of both 
seems less straightforward than that of Q. 

There is also another complication. If one is interested in the location parameter over a longer period, say, 
of 2 years i.e. n replaced with 2n, the relation is 



b2n(j) 



K{j) +a n (j) 



T - 1 



MO) + j c + (a n (j) - o n (0)) 



T< - 1 
7 



This is no longer a linear trend in general. Note that our framework, combining trends in location and scale, 
is not bound to a certain period. 



2'. Peaks over threshold . Davison and Smith ( 1990| ) consider a linear trend in both 7 and a(n/k). Coles 



( 2001| ), p. 119, considers a linear trend in log a{n/k). Estimation is done by maximum likelihood. No asymp- 
totic analysis of the quality of the estimators as the number of observations tends to infinity is made. Again 
the interpretation of a trend in a(n/k) or log a(n/k) seems difficult. 



Hall and Tajvidi (2000) consider a nonlinear trend in 7. The method is likelihood based. No large sample 



results are given. 



3'. [Smith] ([1989]) (c.f. |Coles| ( |200ip p.133 sqq.) considers a trend in the location 



b n (j) = b n (0)+jc 



(simplified) in the point process model. Estimation is by maximum likelihood. No asymptotic analysis (n — > 
00) is made. Another possible problem is that the trend for b n (j) could get out of range if 7 < 0. 

In short: the present paper looks at changes in (tail) probabilities whereas in the literature changes in 
various quantiles have been considered. The two viewpoints are not equivalent. 
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